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■ Abstract 

0^ \ In combinatorics on words, a word w over an alphabet S is said to avoid a pattern p 

over an alphabet A if there is no factor f oiw such that / = h{p) where h : A* — t- S* 
is a non-erasing morphism. A pattern p is said to be /c-avoidable if there exists an 
infinite word over a fe-letter alphabet that avoids p. We give a positive answer to 
Problem 3.3.2 in Lothaire's book 'Algebraic combinatorics on words", that is, every 
Q ■ pattern with k variables of length at least 2^ (resp. 3 x 2^~^) is 3-avoidable (resp. 

2-avoidable) . This improves previous bounds due to Bell and Goh, and Rampersad. 

^ . Key words: Word; Pattern avoidance. 
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1 Introduction 

o 

' A pattern p is a non-empty word over an alphabet A = {A, 5, C, . . . } of 

capital letters called variables. A word x over S is an instance of p if there exists 
a non-erasing morphism : A* — t- S* such that h{p) = x. The avoidability 
^ I index X{p) of a pattern p is the size of the smallest alphabet S such that 

there exists an infinite word w over S containing no instance of p as a factor. 
Bean, Ehrenfeucht, and McNulty [1] and Zimin [15] characterized unavoidable 
patterns, i.e., such that \{p) = oo. We say that a pattern p is t-avoidable if 
^{p) < t. For more informations on pattern avoidability, we refer to Chapter 
3 of Lothaire's book [7]. 

In this paper, we consider upper bounds on the avoidability index of long 
enough patterns with k variables. Bell and Goh [2] and Rampersad [11] used 
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a method based on power series and obtained the following bounds: 
Theorem 1 ([2,11]) Let p be a pattern with exactly k variables. 

(a) Ifp has length at least 2'' then X{p) < 4. [2] 

(b) Ifp has length at least 3'' then \{p) < 3. [11] 

(c) If p has length at least 4*^ then X{p) —2. [11] 

Our main result improves these bounds: 

Theorem 2 Let p be a pattern with exactly k variables. 

(a) Ifp has length at least 2}^ then A(p) < 3. 

(b) If p has length at least 3 x 2}^"^ then A(p) = 2. 

Theorem 2 gives a positive answer to Problem 3.3.2 of Lothaire's book [7]. 

The bound 2}^ in Theorem 2. (a) is tight in the sense that for every k > 
1, the pattern pk with k variables in the family {A, ABA, ABAC ABA, 
ABACABADABACABA, . . . } has length 2^= - 1 and is unavoidable. Sim- 
ilarly, the bound 3 x 2^^"^ in Theorem 2.(b) is tight in the sense that for every 
A; > 1, the pattern with k variables in the family {AA, AABAA,AABAACAABAA, 
AABAACAABAADAABAACAABAA, . . . } has length 3 x 2'"'^ - 1 and is 
not 2-avoidable. Hence, this shows that the upper bound 3 of Theorem 2. (a) 
is best possible. 

The avoidability index of every pattern with at most 3 variables is known, 
thanks to various results in the literature. In particular. Theorem 2 is proved 
for A; < 3: 

• For = 1, the famous results of Thue [13,14] give X{AA) = 3 and X{AAA) = 
2. 

• For k = 2, every binary pattern of length at least 4 contains a square, and 
is thus 3-avoidable. Moreover, Roth [12] proved that every binary pattern 
of length at least 6 is 2-avoidable. 

• For A; = 3, Cassaigne [4] began and the first author [9] finished the determi- 
nation of the avoidability index of every pattern with at most 3 variables. 
Every ternary pattern of length at least 8 is 3-avoidable and every binary 
pattern of length at least 12 is 2-avoidable. 

So, there remains to prove the cases A; > 4. 

Section 2 is devoted to some preliminary results. We prove Theorem 2. (a) in 
Section 3 as a corollary of a result of Bell and Goh [2]. In Section 4, we prove 
Theorem 2.(b) using the so-called entropy compression method. 

2 PreUminctry results 

Let p be a pattern over A = {A, B,C, . . .}. An occurrence y of p is an assig- 
nation of a non-empty words over S to every variable of p that form a factor. 
Note that two distinct occurrences of p may form the same factor. For exam- 
ple, if p = ABA then the occurrence y = {A = 00; B = 1) of p forms the factor 



2 



00100; on the other hand, y' = {A = 0; B — 010) is a distinct occurrence of p 
which forms the same factor 00100. 

A pattern p is doubled if every variable of p appears at least twice in p. 

A pattern p is balanced if it is doubled and every variable of p appears both in 
the prefix and the suffix of length ^ of P- Note that if the pattern has odd 
length, then the variable X that appears in the middle of p (i.e. in position 

■yJ + 1) must appear also in the prefix and in the suffix in order to make p 

balanced. 

Claim 1 For every integer f >2, every pattern with at most k variables and 
length at least f x 2'^~^ contains a balanced pattern p' with at most k' > 1 
variables and length at least f x 2^ as a factor. 

Proof. We prove this claim by induction on A;. If A; = 1, then p has size at 
least / > 2 and is clearly balanced. Suppose this is true for some k = n, i.e. p 
with n variables and length at least / x 2"^^ contains a balanced pattern p' as 
a factor with at most k' variables and length at least f x 2^ Let k = n+1 
and let pi (resp. P2) be the prefix (resp. the suffix) of p of size y\- P 
not balanced, then there exists a variable X in p that does not occur in pi for 
some i G {1,2}. Thus, pi has at most k — 1 — n variables and length at least 
/ X 2"^^. Therefore, by induction hypothesis, p contains a balanced pattern 
with at most k' variables and length at least / x 2'' as a factor. □ 

In the following, we will only use the fact that the pattern p' in Claim 1 is 
doubled instead of balanced. 

3 3-avoidabIe long patterns 

We prove Theorem 2. (a) as a corollary of the following result of Bell and 
Goh [2]: 

Lemma 3 ([2]) Every doubled pattern with at least 6 variables is 3-avoidable. 

Proof of Theorem 2. (a). We want to prove that every pattern with exactly 
k variables and length at least 2^ is 3-avoidable, or equivalently, that every 
pattern with at most k variables and length at least 2^ is 3-avoidable. By 
Claim 1, every such pattern contains a doubled pattern p' as a factor with 
at most A;' > 1 variables and length at least 2^' . So there remains to show 
that every doubled pattern with at most k variables and length at least 2^ 
is 3-avoidable. As discussed in the introduction, the case of patterns with at 
most 3 variables has been settled. Now, it is sufficient to prove that doubled 
patterns of length at least 2^ = 16 are 3-avoidable. 

Suppose that pi is a doubled pattern containing a variable X that appears 
at least 4 times. Replace 2 occurrences of X with a new variable to ob- 
tain a pattern p2. Example: We replace the first and third occurrence of 
B in pi = ABBCDBCABDDCB by a new variable E to obtain p2 = 
AEBCDECABDDCB. Then p2 is a doubled pattern such that \pi\ = \p2\ 
and A(pi) < \{p2), since every occurrence of pi is also an occurrence of p2- 
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Given a doubled pattern p of length at least 16, we make such replacements as 
long as we can. We thus obtain a doubled pattern p' of length at least 16 such 
that A(p) < A(p'). Moreover, every variable in p' appears either 2 or 3 times 
and therefore p' contains at least [16/3] = 6 variables. So p' is 3-avoidable by 
Lemma 3. Thus p is 3-avoidable, which finishes the proof. □ 

4 2-avoidable long patterns 

We want to prove that every pattern with exactly k variables and length at 
least 3 X 2''^^ is 2-avoidable, or equivalently, that every pattern with at most 
k variables and length at least 3 x 2^'^^ is 2-avoidable. By Claim 1, every such 
pattern contains a doubled pattern p' as a factor with at most k' > 1 variables 
and length at least 3 x 2^^'"^. So there remains to show that every doubled 
pattern with at most k variables and length at least 3 x 2^^"^ is 2-avoidable. 

As discussed in the introduction, the case of patterns with at most 3 variables 
has been settled. Now, it is sufficient to prove Theorem 2.(b) for doubled 

patterns and k > 4. 

Let S = {0, 1} be the alphabet. For the remaining of this section, let /c > 4 
and q{k) = 3 x 2''-^. 

Suppose by contradiction that there exists a doubled pattern p on k variables 
and length at least q{k) that is not 2-avoidable. Then there exists an integer 
n such that any word w G contains p. We put an arbitrary order on the k 
variables of p and call Aj the j-th variable of p. 

4-1 The algorithm AvoidPattern 

Let V = {0, 1}* be a vector of length t. The following algorithm takes the 
vector V as input and returns a word w avoiding p and a data structure R 
that is called a record in the remaining of the paper. 

Algorithm 1: AvoidPattern 
Input : V. 

Output: w (a word avoiding p) and R (a data structure). 

w e 
R^d) 

for i •<— 1 to t do 

Append V[i] (the i-th letter of V) to w 
Encode in R that a letter was appended to w 
if w contains an occurrence of p of length i then 

Encode in R the occurrence of p 

Erase the suffix of length £ oi w 

return i?, w 



The way we encode information in R at lines 5 and 7 will be explained in 
Subsection 4.2. 



In the algorithm AvoidPattern, let Wi be the word w after i steps. Clearly, Wi 
avoids p at each step. By contradiction hypothesis, the resulting word w of the 
algorithm (that is Wf) has length less than n. We will prove that each output of 
the algorithm allows to determine the input. Then we obtain a contradiction 
by showing that the number of possible outputs is strictly smaller than the 
number of possible inputs when t is chosen large enough compared to n. This 
implies that every pattern p with at most k variables and length at least q{k) 
is 2-avoidablc. 

To analyze the algorithm, we borrow ideas from graph coloring problems [5,6]. 
These results are based on the Moser-Tardos [8] entropy-compression method 
which is an algorithmic proof of the Lovasz Local Lemma. 

4-2 The record R 

An important part of the algorithm is to keep the record R of each step of the 
algorithm. Let Ri be the record after i steps of the algorithm AvoidPattern. 
On one hand, given V as input of the algorithm, this produces a pair (i?^, Wt). 
On the other hand, given a pair {Rf, Wt), we will show in Lemma 4 that we can 
recover the entire input vector y. So, each input vector V produces a distinct 
pair {Rt,wt). 

Let V be the set of input vectors V of size t, let TZ be the set of records R 
produced by the algorithm AvoidPattern and let ^ be the set of different 
outputs {Rt,Wt). After the execution of the algorithm (t steps), Wt avoids p 
by definition and therefore \wt\ < n by contradiction hypothesis. Hence, the 
number of possible final words wt is independent from t (it is at most 2"). 
We then clearly have |(^| < 2" x |7^|. We will prove that |V| < and that 
\n\ = o(2*) to obtain the contradiction 2* = |V| < |^| < 2" x \n\ = o(2*). 

The record i? is a triplet R = {D,L,X) where D is a binary word (each 
element is or 1), L is a vector of {k — l)-sets of non-zero integers and X is 
a vector of binary words. At the beginning, D, L and X are empty. At step i 
of the algorithm, we append V[i\ to Wi-i to get w'^. 

If w!j^ contains no occurrence of p, then we append to to get Ri and we 
set Wi — w[. Otherwise, suppose that w[ contains an occurrence y of p that 
forms a factor / of length i (/ is the i last letters of w'^. Recall that Aj is the 
j-th variable of p. Let £j = \Aj\ in the factor /, let Li = £i, Lj = Lj_i + £,j 
for j > 2. Let L' = {Li, L2, . . . , Lk-i} be a (A; — l)-set of non-zero integers. 
Let X' be the binary word obtained from Ai ■ A2 ■ . . . ■ (where "•" is the 
concatenation operator) followed by as many O's as necessary to get length 
I J . Note that we necessarily have \Ai ■ A2 • . . . • Ak\ < |J since the pattern 
is doubled. Eventually, to get Ri, we append the factor 01^ to D; we add L' 
as the last element of L; finally we add X' as the last clement of X. 

Example: Let us give an example with k = 3, p = ACBBCBBABCAB and 
V = [0, 0, 1, 0, 0, 1, L 0, 0, L L L 0, 0, L L 0, 1, 1, 1, 0, 0, 0, L L 0]. The variables 
of p were initially ordered as {A,B,C). For the first 24 steps, no occurrence 
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of p appeared, so at each step i < 24, we append V[i] to Wi-i and we append 
one to D. Hence, at step 24, we have: 

• W24 = 001001100111001101110001 
D = 000000000000000000000000 = 0^4 

x = [] 

Now, at step 25, we first append V[25] = 1 to W24 to get 1025 • The word 
contains an occurrence y = {A = 01; B — 1;C — 100) of p which forms a 
factor of length 21 (the 21 last letters of W25). Then we set L' — {2,3} and 
X' = 0111000000 (this is A - B - C followed by four O's in order to get a binary 
word of length 10 = ^ )• Eventually, to get W25 and /?25- we erase the suffix 
of length 21 of W25 get W25, we append the factor 01^^ to D, L' to L, and 
X' to X. This gives: 

• W25 = 0010 

D = 0000000000000000000000000111111111111111111111 = 02^121 
i?25 = <( L = [{2, 3}] 

X = [0111000000] 
This is where our example ends. 

Let Vi be the vector V restricted to its i first elements. We will show that the 
pair {Ri,Wi) at some step i allows to recover V^. 

Lemma 4 After i steps of the algorithm AvoidPattern, the pair {Ri,Wi) 
permits to recover Vi. 

Proof. Before step 1, we have wq — e, Rq — (e, [ ], [ ]), and Vq — e. Let 
Ri — [D, L, X) be the record after step i, with 1 <i <t. 

• Suppose that is a suffix of D. This means that at step i, no occurrence of 
p was found: the algorithm appended V[i\ to Wj-i to get wi. Therefore V[{\ 
is the last letter of Wj, say x. Then the word is obtained from Wi by 
erasing the last letter and the record Ri^i is obtained from R^ by removing 
the suffix of D. We recover from (-Rj-i, Wi^i) by induction hypothesis 
and we obtain — T^_i • x. 

• Suppose now that 01^ is a suffix of D. This means that an occurrence y oi p 
which forms a factor / of length i has been created during step i. The last 
element L' of L is a (A; — l)-set L' = {Li, L2, .... Lk-i} and the last element 



X' of X is a binary word of length 
let is — Lg — Lg-i. So, for 1 < s < 



Let £1 = Li and for 2 < s < /c — 1, 
— 1, £5 is clearly the length of the 



variable Ag of p in the occurrence y by construction of L' . We know the 
pattern p, the total length of the factor / (that is t) and the lengths of the 
k — 1 first variables of p in /, so we are able to compute the length Ik of 
the last variable Aj.. So we are now able to recover the occurrence y of p: 
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the first £i letters of X' correspond to Ai, the next £2 letters correspond to 
A2 and so on {X' may contain some O's at the end which are not relevant). 
It follows that the factor / is completely determined. So is obtained 
from Wj • / by removing the last letter x of /, this letter x being V[i] (the 
appended letter to Wi-i at step i to get w'^). The record Ri-i is obtained 
from Ri as follows: remove the suffix 01^ from D; remove the last clement of 
L and the last element of X. We recover V^_i from {Ri-i, Wi-i) by induction 
hypothesis and we obtain Vi = Vi_i • x. 

□ 

The previous lemma proves that distinct input vectors cannot correspond to 
the same pair {Rt,Wt). So we get |V| < \&\. 

4-3 Analysis ofTZ 

Now we compute \TZ\. Let R — Rt — {D, L, X) be a given record produced by 
an execution of AvoidPattern. Let P, C and X be the set of such binary 
words D, of such {k — l)-scts of non-zero integers L, and of such vectors of 
binary words X, respectively. We thus have \R\ < \D\ x |£| x \X\. 

Let us give some useful information in order to get upper bounds on \'D\^ \X\^ 
and \C\. The algorithm runs in t steps. At each step, one letter is appended 
to w, so t letters have been appended and therefore the number of erased 
letters during the execution of the algorithm is t — \wt\. At some steps, an 
occurrence of p appears and forms a factor which is immediately erased. Let 
m be the number of erased factors during the execution of the algorithm. Let 
fij^^i^m, be the m erased factors. We have \ fi\ > q{k) since each variable 
of p is a non-empty word and p has length at least q{k). Moreover, we have 
Si<«<m \fi\ — t — \wt\ <t. Each time a factor /j is erased, we add an element 
to L and X, so \L\ — \X\ — m. 

4.3.1 Analysis ofV 

In the binary word D, each corresponds to an appended letter during the 
execution of the algorithm and each 1 corresponds to an erased letter. There- 
fore, D has length 2t—\w-t\. Observe that every prefix in D contains at least as 
many O's as I's. Indeed, since a 1 corresponds to an erased letter this letter 
x had to be added first and thus there is a before that corresponds to this 1. 
The word D is therefore a partial Dyck word. Since any erased factor fi has 
length at least q{k), any maximal sequence of I's (which is called a descent in 
the sequel) in D has length at least q{k). So D is a partial Dyck words with t 
O's such that each descent has length at least q{k). The following two lemmas 
due to Esperet and Parreau [6] give an upper bound on \'D\. 

Let Ct.r,d (resp. Ct,d) be the number of partial Dyck words with t O's and t — r 
I's (resp. Dyck words of length 2t) such that all descents have length at least 
d. Hence, we have I'PI < Ct,\w^iq(k). 

Lemma b [6] Ct,r,d < Ct+d-i,d- 
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Hence, we have |X>| < Ct+q^k)-i,q(k)- 
Let0d(x) = l + Ei><i2;^ = l + 1^. 

Lemma 6 [6] Let d he an integer such that the equation (pd{x) ~ ^(p'di^) = 
has a solution r with < t < r, where r is the radius of convergence of 
(pd- Then r is the unique solution of the equation in the open interval (0,r). 
Moreover, there exists a constant Cd such that Ctd ^ (^dld^~^ where jd — 

The solution of the equation (j)d{x) — X(f)'^{x) = is equivalent to P{x) = 
(1 — xy + (1 — d)x'^ + {d — 2)x'^~^^ = 0. The radius of convergence r of 4>d is 
1 and since P(0) = 1 and P(r) = —1, P{x) — has a solution r in the open 
interval (0, r). By Lemma 6, this solution is unique and, for some constant q, 
we have Ct+d-iA < Cd'^*^'^~^ {t -\- d — 1)~2 with 7^ = 0^(r). We clearly have 
Ct+d-14 = o(7d)- So, we can compute 7^^ for d fixed. Wc will use the following 
bounds: 724 < 1.27575, 740 < 1.15685, and 7100 < 1.08603. Note that when d 
increases, 7^ decreases. 

So, by Lemmas 5 and 6, when t is large enough, we have \D\ < 1.27575* (rcsp. 
\V\ < 1.15685*, \V\ < 1.08603*) if the length of any descent is at least 24 (resp. 
48, 100). 

4.3.2 Analysis of X 

Each element X' of X corresponds to an erased factor /j and by construction 



\X'\ -- 
...+ 



f\ - So the sum of the lengths of the elements of X is 



1/1 1 


+ 


I/2I 


2 




2 



+ 



at most 



2 

' t 



< 



. Thus, the vector X corresponds to a binary word of length 



. Therefore \X\ < 2^^] < 



4.3.3 Analysis of C 

Each element L' = {Li, L2, . . . , L^-i} of L corresponds to an erased factor /j 
and by construction each Lj e L' corresponds to the sum of the lengths of the 
j first variables of p in /j. 

Let hk{i) be the number of such (fc — l)-scts L' that correspond to factors 
of length £. Recall that \fi\ > q{k), so hk{t) is defined for A; > 4 and £ > 
q{k). Each of the m elements of L corresponds to an erased factor, so |£| < 
hk{\fi\) X hk{\f2\) X ... X hk{\fm\)- Let gk{tj = hk{tj-t defined for /c > 4 and 
£ > q{k). Then C < (?fc(|/i|)l^^l x (?fc(|/2|)l^^l x . . . x ^fc(|/™|)l^™l. So, if we are 
able to upper-bound gk{(^) by some constant c for all £ > q{k), then we get 
|£| < d-'^il X cl-'^^l X ... X d-^-l < c*. 

Now we bound gk{£) using two different methods depending on the value of k 
and the length q{k) of p. 

4.3.3.1 Bound on gk{i) for k = 4, i > 100 or k>5, £>48 

For any factor fi, we have \Ai ■ A2 ■ . . . ■ Ak\ < Ll/«l/2j since p is doubled 
(each variable appears at least twice). For a given L' = {Li, L2, . . . , Lk-i} 
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that corresponds to some factor /i, we have Lk-i = \Ai ■ • . . . • < 
[|/j|/2j. Therefore, L' is a {k — l)-set of distinct non-zero integers at most 
[|/j|/2j, i.e. k — 1 integers chosen among integers between 1 and [|/i|/2j; 

thus hk{i) < (Lfc/^i^) and so gk{e) < (^^5^)'- We can upper-bound gk{l) by 
= I {k-iy. J ^ The function gk{i) is decreasing for £ > q{k); 

so Qkii) < < M<l{k)) = (^ ^^jjl j ^ for all i > q{k). Moreover, we 

can sec that 'gk{q{k)) < ^(48) for all k > 5. 

Computing this upper bound, we get gk{i) < ^(?(^)) < ^(48) < 1.21973 for 
all A; > 5 and £ > 48 and g^ii) < ^(100) < 1.10456 for all i > 100. 

4.3.3.2 Bound on g^ii) for 24 < £ < 99 

The second method to bound the size of gA{£) is based on ordinary generating 
functions (OGF). Here, A; = 4, so let Ai, A2, A^, A4 be the four variables of p 
and let Oj be the number of apparitions of A^ in p. Therefore, 01 + 02 + 03-1-04 = 
\p\. Recall that each variable appears at least twice in p since p is doubled, 
so Oj > 2. Moreover, a factor of length i, with 24 < £ < 99, is necessarily 
an occurrence of a pattern of length between 24 and 99. So we just have to 
consider patterns p with 24 < \p\ < 99. 

Given L' = {Li, L2, L^} an element of L that corresponds to some factor /j, we 
can compute the lengths of each variable A^ in [ii = Li, = Lj — Lj_i for 
i e {2, 3} and £4 = lIihi2lil±^2il±mMy Recall that > 1 since each variable 
of p is a non-empty word. Let A\p\ — J2i>\p\ h be the OGF of such sets L', i.e. 
hi is the number of 3-sets {Li, L2, L3} that corresponds to a factor of length i 
formed by an occurrence of a pattern of length \p\ (that is hi is the number of 
4-tuples (^1,^2, 4, ^4) with£j > 1 such that Oix£i+02x£2+a3x£3+04x£4 = i). 
So by definition of h/^, we have /i4(£) = h^. 

This kind of OGF has been studied and is similar to the well-known problem 
of counting the number of ways you can change a dollar [10] : you have only five 
types of coins (pennies, nickels, dimes, quarters, and half dollars) and you want 
to count the number of ways you can change any amount of cents. So, let C — 
Hi>i Ci be the OGF of the problem and thus any q is the number of ways 
you can change i cents. Then, for example, Cioo corresponds to the number of 
ways you can change a dollar. Here, C = x x x x 

In our case, we have four coins, each of them has value Oj (so we can have 
different types of coins with the same value) and each type of coins appears 
at least once (since £i>l). Thus we get A\p\ — J2i>\p\ h — ^ x 
X Tz^- Maple for our computation. For each 24 < \p\ < 99, for 

each 4-tuple (01,02,03,04) such that X^c^i = b|, we consider the associated 
OGF ^IpI and we compute, using Maple, the truncated series expansion up 
to the order 100, that gives A\p\ = h24X^^ + 6252;^^ + . . . + 6992;^^ + 0(a;^°°) 
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with explicit values for the coefficients 6j. So, for any 2A < i < 99, g4{i) is 
upper-bounded by the maximum of [b^ji taken oven all A\p\. Maple gives that 
(bi)^ is maximal for \p\ = 24, (ai, 02, 03, 04) = (2, 2, 2, 18), and £ = 46: in this 
case, 646 = 84 (i.e. there exist 84 distinct 3-sets L' that correspond to some 
factor of length 46 formed by an occurrence of a pattern of length 24). So, 
g^ii) < 84^ < 1.10112 for all 24 < £ < 99, A; = 4 and 24 < \p\ < 99. 

4.3.3.3 Bound g{£) for all k > A 

We can deduce from Paragraphs 4.3.3.1 and 4.3.3.2 the following. 

If A; = 4, then g^^i) < 1.10112 for 24 < £ < 99 and g^ii) < 1.10456 for i > 100. 
So for A; = 4, we have |£| < (1.10456)*. 

If A; > 5, then gk{i) < 1.21973 for i > q{k). So for A; > 5, we have \C\ < 
(1.21973)*. 

4.4 Conclusion 

Aggregating the above analysis, we get the following. For A; > 5, we have 
q{k) > 48: then |7^| < \V\ x \C\ x \X\ < (1.15685 x 1.21973 x ^^2)* = o(2*). For 
A; = 4, we have q{k) > 24: then |7^| < \V\ x |£| x \X\ < (1.27575 x 1.10456 x 
V2)* = o(2*). 

Thus for all A; > 4, \7l\ = o(2*) and so we obtained the desired contradiction: 
2* = |V| < 1^1 < 2" x |7^| = 2" X 0(2*) = 0(2*). 

5 Conclusion 

In our results, we heavily use the fact that the patterns are doubled. The fact 
that the patterns arc long is convenient for our proofs but does not seem so 
important. So we ask whether every doubled pattern is 3-avoidable. By the 
remarks in Section 1 and by Lemma 3, the only remaining cases are doubled 
patterns with 4 and 5 variables. Also, does there exist a finite k such that 
every doubled pattern with at least k variables is 2-avoidable ? We know that 
such a A; is at least 5 since ABCCBADD is not 2-avoidable. 
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